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pq . Abstract 

Qh' We consider worst case time bounds for NP-complete problems including 3-S AT, 3-coloring, 3-edge- 

^ , coloring, and 3-list-coloring. Our algorithms are based on a constraint satisfaction (CSP) formulation of 

these problems; 3-SAT is equivalent to (2, 3)-CSP while the other problems above are special cases of 

CO , (3, 2)-CSP. We give a fast algorithm for (3, 2)-CSP and use it to improve the time bounds for solving the 

other problems listed above. Our techniques involve a mixture of Davis-Putnam-style backtracking with 

more sophisticated matching and network flow based ideas. 



X 



1 Introduction 

There has recently been growing interest in analysis of superpolynomial-time algorithms, including algo- 
rithms for NP-hard problems such as satisfiability or graph coloring. This interest has multiple causes: 
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^ ■ • Many important applications can be modeled with these problems, and with the increased speed of 

f— N ', modern computers, solved effectively; for instance it is now routine to solve hard 500- variable sat- 

G^ ' isfiability instances, and structured instances with up to 10000 variables can often be handled in 

o ■ 

O 

--^ . • Improvements in exponential time bounds are especially critical in determining the size of problems 

Q I that can be solved: an improvement from 0{2^'^") to 0(2'^2") implies a factor ofc\/c2 improvement in 

the solvable problem size while technological developments can only improve the size by an additive 
constant. 



practice [24] 



• Approximation algorithms for many of these problems are often either nonsensical (how does one 
approximate SAT?) or inadequate. 

• The large gap between the known theoretical worst case bounds and results from empirical testing of 
implementations provides an interesting challenge to algorithm researchers. 

In this paper we continue our previous work on exact algorithms for 3-coloring, 3-edge-coloring, and 
3-SAT 1^. Each of these problems can be expressed as a form of constraint satisfaction (CSP). We solve 
instances of CSP with at most two variables per constraint by showing that such an instance either contains 
a good local configuration allowing us to split the problem into several smaller instances, or can be solved 
directly by graph matching. We solve graph 3-coloring by using techniques including network flow to find a 
small set of vertices with many neighbors, choosing colors for that set of vertices, and treating the remaining 
problem using our constraint satisfaction algorithm. We solve graph 3-edge-coloring by a further level of 
case analysis: we use graph matching to find a large set of good local configurations, each of which can be 
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applied independently yielding a set of instances of a generalized edge coloring problems in which certain 
pairs of edges are constrained to have distinct colors. We then solve this generalized coloring problem with 
our vertex coloring algorithm. 

Beyond simply adding additional case analysis, the improvements in the present work stem from the 
following new ideas: 

• Extending our previous (3,2)-CSP algorithm to (4,2)-CSP, and measuring the size of a (4,2)-CSP 
instance in terms of a parameter e which can be varied to achieve the optimal tradeoff between different 
cases in the analysis. 

• Stopping the search when a CSP instance can be solved by a graph matching algorithm, rather than 
continuing the case analysis of an instance until it can be determined directly to be solvable or un- 
solvable. 

• Eliminating cycles of low-degree vertices from vertex coloring instances, in order to show that a large 
fraction of the graph can be covered by the neighborhoods of few high degree vertices. 

• Using network flow techniques to cover a vertex coloring instance with a forest that avoids certain bad 
kinds of trees. Our previous paper instead performed a similar step using a complicated case analysis 
in place of network flow, and achieved weaker limitations on the types of trees occurring in the forest. 

• Introducing a generalization of edge coloring, so that we can perform reductions while staying in the 
same problem class before treating the problem as an instance of vertex coloring, and using graph 
matching to find many independent good local configurations in an edge coloring instance. 

We omit most of the case analysis and proofs in this extended abstract. For details see the full paper [^, 
which combines these new results with those from our previous conference paper ^. 

1.1 New Results 

We show the following: 

• A (3, 2)-CSP instance with n variables can be solved in worst case time (9(1.3645"), independent of 
the number of constraints. We also give a very simple randomized algorithm for solving this problem 
in expected time 0{n^^^h"/^) ^ 0(1.4142"). 

• A {d, 2)-CSP instance with n variables and d > 3 can be solved by a randomized algorithm in expected 

time ©( (0.45 18 J)"). 

• 3-coloring in a graph of n vertices can be solved in time 0(1.3289"), independent of the number of 
edges in the graph. 

• 3-list-coloring (graph coloring given a list at each vertex of three possible colors chosen from some 
larger set) can be solved in time 0(1.3645"), independent of the number of edges. 

• 3 -edge-coloring in an n- vertex graph can be solved in time 0(2"/^), again independent of the number 
of edges. 

• 3-satisfiability of a formula with t 3-clauses can be solved in time 0{rP'^^^ + 1.3645^), independent 
of the number of variables or 2-clauses in the formula. 

Except where otherwise specified, n denotes the number of vertices in a graph or variables in a SAT or 
CSP instance, while m denotes the number of edges in a graph, constraints in an CSP instance, or clauses in 
a SAT problem. 



1.2 Related Work 



For three-coloring, we know of several relevant references. Lawler [15] is primarily concerned with the 
general chromatic number, but also gives a simple 0(1. 4422") -time algorithm for 3-coloring: for each 
maximal independent set, test whether the complement is bipartite. Schiermeyer [ p2| ] improves this to 
0(1.415"), and our previous conference paper [§] further reduced the time bound to 0(1.3443"). Our 
0(1.3289") bound significantly improves all of these results. 

There has also been some related work on approximate or heuristic 3-coloring algorithms. Blum and 
Karger [Q] show that any 3-chromatic graph can be colored with 0{n^'^^) colors in polynomial time. Alon 
and Kahale ^ describe a technique for coloring random 3-chromatic graphs in expected polynomial time, 
and Petford and Welsh [jT^ present a randomized algorithm for 3-coloring graphs which also works well 



empirically on random graphs although they prove no bounds on its running time. Finally, Vlasie [ 25 1 has 
described a class of instances which are (unlike random 3-chromatic graphs) difficult to color. 

Several authors have described exact algorithms for Boolean formula satisfiability ||5|-^^, 10, 12, 13, 
16-18, 20, 21 1 . Very recently, Schoning [23] has described a simple and powerful randomized algorithm 
for /:-SAT and more general constraint satisfaction problems, including the CSP instances that we use in 
our solution of 3-coloring. For 3-SAT, Schoning's algorithm takes expected time 0((4/3 + e)") However, 
for {d, 2)-CSP, Schoning notes that his method is not as good as a randomized approach based on an idea 
from our previous conference paper [§] : simply choose a random pair of values for each variable and solve 
the resulting 2-SAT instance in polynomial time, giving an overall bound of 0{{d/2)"n'~'^^'). Feder and 
Motwani [Q] have an alternative randomized algorithm which takes time 0(((i!)"/^«^'^'^), an improvement 
over Schoning for (i > 6 (and over our results ford> 11). The table below compares these bounds with 
our new results; an entry with value x in column d indicates a time bound of 0{x^n'~^^^^) for {d, 2)-CSP. 
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The only prior work we found for 3-edge coloring was our own 0(1.5039") bound [^. Since any 3- 
edge-chromatic graph has at most 3?2/2 edges, one can transform the problem to 3-vertex-coloring at the 
expense of increasing « by a factor of 3/2. If we applied our vertex coloring algorithm we would then get 
time 0(1.5319"). Both of these bounds are significantly improved by the one we state. 



It is interesting that, historically, until the work of Schoning [23], the time bounds for 3-coloring have 
been smaller than those for 3-satisfiability (in terms of the number of vertices or variables respectively). 
Schoning's 0((4/3 + e)") time bound for 3-SAT reversed this pattern by being smaller than the previous 
0(1.3443") bound for 3-coloring from our previous paper ^. The present work restores 3-coloring to a 
smaller time bound than 3-SAT. 



2 Constraint Satisfaction Problems 



We now describe a common generalization of satisfiability and graph coloring as a constraint satisfaction 
problem (CSP) [ [l4[^ . We are given a collection of n variables, each of which has a list of possible colors 
allowed. We are also given a collection of m constraints, consisting of a tuple of variables and a color for 
each variable. A constraint is satisfied by a coloring if not every variable in the tuple is colored in the way 
specified by the constraint. We would like to choose one color from the allowed list of each variable, in a 
way not conflicting with any constraints. 









Figure 1: Example (3,2)-CSP instance with five variables and twenty constraints (left), and a solution of 
the instance (right). 





Figure 2: Example 3 -coloring instance (left) and translation into a (3, 2)-CSP instance (right). 



For instance, 3-satisfiability can easily be expressed in this form. Each variable of the satisfiability 
problem may be colored (assigned the value) either true (T) ot false (F). For each clause like (jcj Va'2 V -■^1:3), 
we make a constraint ((vi,F), (v2,F), (v3, T)). Such a constraint is satisfied if and only if at least one of the 
corresponding clause's terms is true. 

In the (a, b)-CSP problem, we restrict our attention to instances in which each variable has at most a 
possible colors and each constraint involves at most b variables. The CSP instance constructed above from a 
3-SAT instance is then a (2, 3)-CSP instance, and in fact 3-SAT is easily seen to be equivalent to (2, 3)-CSP. 

In this paper, we will concentrate our attention instead on (3, 2)-CSP and (4, 2)-CSP. We can represent 
a {d,2)-CSF instance graphically, by interpreting each variable as a vertex containing up to d possible 
colors, and by drawing edges connecting incompatible pairs of vertex colors (Figure |T]). Note that this 
graphical structure is not actually a graph, as the edges connect colors within a vertex rather than the vertices 
themselves. However, graph 3-colorability and graph 3-list-colorability can be translated directly to a form 
of (3,2)-CSP: we keep the original vertices of the graph and their possible colors, and add up to three 
constraints for each edge of the graph to enforce the condition that the edge's endpoints have different 
colors (Figure ^. 

Of course, since these problems are all NP-complete, the theory of NP-completeness provides transla- 
tions from one problem to the other, but the translations above are size-preserving and very simple. Our 
graph coloring techniques include more complicated translations in which the input graph is partially col- 
ored before treating the remaining graph as an CSP instance, leading to improved time bounds over our pure 
CSP algorithm. 





Figure 3: Isolated constraint between two three-color variables (left) is equivalent to a single four-color 
variable (right). 

3 Constraint Satisfaction Algorithm 

We now outline our (4,2)-CSP algorithm. A (4,2)-CSP instance can be transformed into a (3,2)-CSP 
instance by expanding its four-color variables to two three-color variables (Figure 0), so a natural definition 
of the "size" of a (4, 2)-CSP instance is « = «3 + 2«4, where rij denotes the number of /-color variables. 
However, we instead define the size asn = n^ + {2 — e)n^, where e is a constant to be determined later. The 
size of a (3, 2)-CSP instance remains equal to its number of variables, so any bound on the running time of 
our algorithm in terms of n applies directly to (3, 2)-CSP. 

Our basic idea is to find a set of local configurations that must occur within any (4, 2)-CSP instance /. 
For each configuration we describe a set of smaller instances /; of size |/| — r,- such that / is solvable if and 
only if at least one of the instances /, is solvable. If one particular configuration occurred at each step of the 
algorithm, this would lead to a recurrence of the form 

T{n) = Y.T{n- n) + poly(n) = 0(A(ri, r2, . . .)") 

for the runtime of our algorithm, where A(ri, r2, . . .) is the largest zero of the function /(jc) = 1 — Yl,^~'^'- 
We call this value A(ri , r2, . . .) the work factor of the given local configuration. The overall time bound will 
be A" where A is the largest work factor among the configurations we identify. 

We first consider local configurations in which some (variable,color) pair (v, R) is involved in only a 
single constraint ((v, 7?), (w,/?)). If this is also the only constraint involving {w,R), and both v and w have 
three colors, they can be replaced by a single four-color variable (Figure ||); any other singly-constrained 
color leads to a problem reduction with work factor A(2 — e,3— e). 

We next find colors with multiple constraints to different colors of the same variable, and show that 
such a case has work factor A(2 — e,3— 2e). The next case we consider involves colors constrained by four 
or more neighboring variables, or four-color variables with a color constrained by three variables. In these 
cases, choosing to use or not use the highly-constrained color gives work factor A(l — e, 5 — 4e). Instances 
in which none of the above cases applies have a special form: each (variable,color) pair has exactly two or 
three constraints, which must involve distinct variables. 

Our next sequence of cases concerns adjacency between (variable,color) pairs with two constraints and 
pairs with three constraints. We show that, if (v,/?) has three constraints, one of which connects it to a 
variable with four color choices, then the instance can be replaced by smaller instances with work factor 
at most A(3— e,4 — e,4 — e). If this case does not apply, and a three-constraint pair (v,/?) is adjacent 
to a (variable,color) pair with two constraints, then we have additional cases with work factor at most 
max{A(l + e,4), A(3,4 — e,4)}. If none of these cases applies to an instance, then each color choice 
in the instance must have either two or three constraints, and each neighbor of that choice must have the 
same number of constraints. 

We now consider the remaining (variable,color) pairs that have three constraints each. Define a three- 
component to be a subset of such pairs such that any pair in the subset is connected to any other by a path 



of constraints. We distinguish two such types of components: a small three-component is one that involves 
only four distinct variables, while a large three-component involves five or more variables. A small three- 
component is good if it involves only four (variable,color) pairs. We show that an instance containing a small 
three-component that is not good can be replaced by smaller instances with work factor at most A(4,4,4), 
and that an instance containing a large three-component can be replaced by smaller instances with work 
factor at most A(4, 4, 5, 5). As a consequence, we can assume all remaining three-components are good. 

Finally, we define a two-component to be a subset of (variable,color) pairs such that each has two con- 
straints, and any pair in the subset is connected to any other by a path of constraints. Our analysis of 
two-components is essentially the same as in [||], and shows that unless a two-component forms a triangle, 
the instance can be replaced by smaller instances with work factor at most A(3, 3, 5). 

Suppose we have a (4, 2)-CSP instance to which none of the preceding reduction cases applies. Then, 
every constraint must be part of a good three-component or a triangular two-component. As we now show, 
this simple structure enables us to solve the remaining problem quickly. 

Lemma \ If we are given a {4,2)-CSP instance in which every constraint must be part of a good three- 
component or a small two-component, then we can solve it or determine that it is not solvable in polynomial 
time. 

Proof: We form a bipartite graph, in which the vertices correspond to the variables and components of 
the instance. We connect a variable to a component by an edge if there is a (variable,color) pair using that 
variable and belonging to that component. The instance is solvable iff this graph has a matching covering 
all variables. □ 

This completes the analysis needed for our result. 

Theorem 1 We can solve any (3, 2)-CSP instance in time C(A(4, 4, 5, 5)") « 0(1.36443"). 

Proof: We employ a backtracking (depth first) search in a state space consisting of (3,2)-CSP instances. 
At each step we examine the current state, match it to one of the cases above, and recursively search each 
smaller instance. If we reach an instance in which Lemma |l] applies, we perform a matching algorithm and 
either stop with a solution or backtrack to the most recent branching point of the search and continue with 
the next alternative. 

A bound of A" on the number of recursive calls in this search algorithm, where A is the maximum 
work factor occurring in our reduction lemmas, can be proven by induction on the size of an instance. To 
determine the maximum work factor, we need to set the parameter e. We used Mathematica to optimize e 
numerically, and found that for e ^ 0.095543 the work factor is ?» 1.36443 ^ A(4,4, 5, 5). For e near this 
value, the largest work factors involving eare A(3— e,4 — e,4 — e), and A(l+e,4); the remaining work factors 
are below 1.36. The true optimum value of e is thus the one for which A(3— e, 4 — e, 4 — e) = A(l + e, 4). 

As we now show, for this optimum e, A(3— e,4 — e,4 — e) = A(l + e,4) = A(4,4, 5,5), which also 
arises as a work factor in another case. Consider subdividing an instance of size n into one of size « — (1 + e) 
and another of size n — 4, and then further subdividing the first instance into subinstances of size « — ( 1 + 
e) — (3— e), « — (1 + e) — (4 — e), and « — (1 + e) — (4 — e). This four-way subdivision has work factor 
A(4,4,5,5), and combines subdivisions of type A(l + e,4) and A(3— e,4 — e,4 — e), so these three work 
factors must be equal. □ 

We use the quantity A(4,4,5,5) frequently in the remainder of the paper, so we use A to denote this 
value. Theorem [I| immediately gives algorithms for some more well known problems, some of which we 
improve later. Of these, the least familiar is likely to be list k-coloring: given at each vertex of a graph a 
list of k colors chosen from some larger set, find a coloring of the whole graph in which each vertex color is 



chosen from the corresponding list [11] 



Corollary 1 We can solve the 3-colormg and 3-list coloring problems in time 0{A"), the 3-edge-coloring 
problem in time 0{A'^), and the 3-SAT problem in time 0{A'), 



Corollary 2 There is a randomized algorithm which finds the solution to any solvable {d, 2)-CSP instance 
(with d > 3) in expected time C'((0.4518(3f)"). 

Proof: Randomly choose a subset of four values for each variable and apply our algorithm to the resulting 
(4, 2)-CSP problem. Repeat with a new random choice until finding a solvable (4, 2)-CSP instance. The 
random restriction of a variable has probability 4/d of preserving solvability so the expected number of 
trials is (<i/4)". Each trial takes time ©(A^^^^)") k, 0(1.8072"). The total expected time is therefore 

0((rf/4)" 1.8072"). □ 



4 Vertex Coloring 

Simply by translating a 3-coloring problem into a (3,2)-CSP instance, as described above, we can test 3- 
colorability in time 0(A"). We reduce this further (as in pp) by finding a small set of vertices S C V{G) 
with a large set A^ of neighbors, and choosing one of the 3^' colorings for all vertices in S. For each such 
coloring, we translate the remaining problem to a (3, 2)-CSP instance. The vertices in S are already colored 
and need not be included in the (3,2)-CSP instance, and the vertices in N can also be eliminated, so the 
overall time is 0{3^^^ h}^^^^~^~^^). By choosing S appropriately we can make this quantity smaller than 
0(A"). 

We begin by showing a cycle of degree-three vertices allows us to reduce a 3-coloring instance to smaller 
instances with work factor A (5, 6, 7, 8) k. 1 .2433, and that a tree of eight or more such vertices leads to work 
factor A(2, 5, 6) « 1.3247. Therefore, we can assume that the degree-three vertices form trees of at most 
seven vertices, and that the graph contains many higher degree vertices. 

We define a bushy forest to be an unrooted forest within a given instance graph, such that each internal 
node has degree four or more. Because each tree of k degree-three vertices must have at least 9k /I outgoing 
edges, we can show that the number of vertices excluded from a maximal bushy forest is at most 20r/3, 
where r is its number of leaves. 

All internal nodes of the maximal bushy forest F will be included in S, but we also wish to include some 
of the remaining graph vertices. To do this, we form these vertices into trees of height two, with at most five 
grandchildren. Further, in a tree with four or more grandchildren, at least one node must have degree four or 
more in G. This forest of height-two trees can be found by the following flow -based technique (Figure ^: 
first, find a set of disjoint ^13 subgraphs in G\F, maximal under operations that remove one such subgraph 
and form two or more from the remaining vertices. Assign grandchildren to these height-one trees, from the 
remaining vertices nonadjacent to F, with fractional weights spread evenly among the trees each child can 
be assigned to. This fractional assignment can be shown to have the bounds we want on the total assigned 
weight of grandchildren per tree, but does not form a disjoint set of trees. However, we can represent the 
possible assignments of grandchildren to trees using a flow graph, and use the fact that every flow problem 
has an optimal integer solution, to find a non-fractional assignment of grandchildren to trees with the same 
bounds. We will add to S certain tree roots or their children, depending on the shape of each height-two tree. 

Theorem 2 We can solve the 3-coloring problem in time C)((23/4934/49^24/49yi) ^ 1.3289". 

Proof: As described, we find a maximal bushy forest F, then cover the remaining vertices by height-two 
trees. We choose colors for each internal vertex in F, and for certain vertices in the height-two trees. Vertices 






Figure 4: Use of maximum flow to find a good height-two forest. Top: forest of /Tj 3 subgraphs and adjacent 
vertices. The left subgraph is adjacent to the maximal bushy forest; the other two subgraphs aie not. Top 
middle: flow graph and fractional flow formed by spreading flow equally from each of the bottom vertices. 
The edge capacities are all one, except for the top three which are respectively 5, 3, and 3. Bottom middle: 
maximum integer flow for the same flow graph. Bottom: height-two forest corresponding to the given 
integer flow. 





Figure 5: Replacement of five edges (left) by two constrained edges (right). 

adjacent to these colored vertices are restricted to two colors, while the remaining vertices form a (3, 2)-CSP 
instance and can be colored using our general (3, 2)-CSP algorithm. 

Let/7 denote the number of vertices that are roots in F; ^ denote the number of non-root internal vertices; 
r denote the number of leaves of F; s denote the number of vertices adjacent to leaves of F; and t denote 
the number of remaining vertices, which must all be degree-three vertices in the height-two forest. We show 
that, if we assign cost A to each vertex adjacent to a leaf of F, then the cost of coloring each height-two 
tree, averaged over its remaining vertices, is at most (3A^)^/^ per vertex. Therefore, the total time for the 
algorithm is at most 3''2^A-*(3A3)'/'^. 

This bound is subject to the constraints p,q,r,s,t >0, p + q + r + s + t = n,4p + 2q < r,2r > s, and 
20r/3 > s + t. The worst case occurs when s + t = 20r/3, s = 2r, t = 14r/3, 4p + 2q = r, p = 0, and 
r = 2q, giving the stated bound. □ 



5 Edge Coloring 

We now describe an algorithm for finding edge colorings of undirected graphs, using at most three colors, 
if such colorings exist. We can assume without loss of generality that the graph has vertex degree at most 
three. Then m < 3n/2, so by applying our vertex coloring algorithm to the line graph of G we could achieve 
time bound 1.3289^"'^ « 1.5319". Just as we improved our vertex coloring algorithm by performing some 
reductions in the vertex coloring model before treating the problem as a (3,2)-CSP instance, we improve 
this edge coloring bound by performing some reductions in the edge coloring model before treating the 
problem as a vertex coloring instance. 

The main idea is to solve a problem intermediate in generality between 3 -edge-coloring and 3-vertex- 
coloring: 3-edge-coloring with some added constraints that certain pairs of edges should not be the same 
color. 

Lemma 2 Suppose a constrained 3-edge-coloring instance contains an unconstrained edge connecting two 
degree-three vertices. Then the instance can be replaced by two smaller instances with three fewer edges 
and two fewer vertices each. 

This reduction operation is depicted in Figure §. 

We let mj, denote the number of edges with three neighbors in an unconstrained 3-edge-coloring instance, 
and /M4 denote the number of edges with four neighbors. Edges with fewer neighbors can be removed at 
no cost, so we can assume without loss of generality that m = mj, -\- m^. By using a maximum matching 
algorithm, we can find a set of m^./^ edges such that Lemma | can be applied independently to each edge. 



Theorem 3 We can 3-edge-color any 3-edge-colorable graph, in time 0(2"' ). 

Proof: We apply Lemma ^m^/3 times, resulting in a set of 2'"4/3 constrained 3 -edge-coloring problems 
each having only m^ edges. We then treat these remaining problems as 3-vertex-coloring problems on the 
corresponding line graphs, augmented by additional edges representing the constraints added by Lemma §. 
The time for this algorithm is thus at most 0(1.3289"'3 2"'4/3). This is maximized when m4 = 3?i/2 and 
m3 = 0. □ 
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